Search CORE

37 research outputs found

On the Expressive Power of Regular Expressions with Backreferences

Author: Nogami Taisei
Terauchi Tachio
Publication venue
Publication date: 08/08/2023
Field of study

A rewb is a regular expression extended with a feature called backreference. It is broadly known that backreference is a practical extension of regular expressions, and is supported by most modern regular expression engines, such as those in the standard libraries of Java, Python, and more. Meanwhile, indexed languages are the languages generated by indexed grammars, a formal grammar class proposed by A.V.Aho. We show that these two models' expressive powers are related in the following way: every language described by a rewb is an indexed language. As the smallest formal grammar class previously known to contain rewbs is the class of context sensitive languages, our result strictly improves the known upper-bound. Moreover, we prove the following two claims: there exists a rewb whose language does not belong to the class of stack languages, which is a proper subclass of indexed languages, and the language described by a rewb without a captured reference is in the class of nonerasing stack languages, which is a proper subclass of stack languages. Finally, we show that the hierarchy investigated in a prior study, which separates the expressive power of rewbs by the notion of nested levels, is within the class of nonerasing stack languages.Comment: 20 pages, the full version of the paper to appear in MFCS 202

arXiv.org e-Print Archive

On the Expressive Power of Regular Expressions with Backreferences

Author: Nogami Taisei
Terauchi Tachio
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 48th International Symposium on Mathematical Foundations of Computer Science (MFCS 2023)
Publication date: 01/01/2023
Field of study

A rewb is a regular expression extended with a feature called backreference. It is broadly known that backreference is a practical extension of regular expressions, and is supported by most modern regular expression engines, such as those in the standard libraries of Java, Python, and more. Meanwhile, indexed languages are the languages generated by indexed grammars, a formal grammar class proposed by A.V.Aho. We show that these two models\u27 expressive powers are related in the following way: every language described by a rewb is an indexed language. As the smallest formal grammar class previously known to contain rewbs is the class of context sensitive languages, our result strictly improves the known upper-bound. Moreover, we prove the following two claims: there exists a rewb whose language does not belong to the class of stack languages, which is a proper subclass of indexed languages, and the language described by a rewb without a captured reference is in the class of nonerasing stack languages, which is a proper subclass of stack languages. Finally, we show that the hierarchy investigated in a prior study, which separates the expressive power of rewbs by the notion of nested levels, is within the class of nonerasing stack languages

Dagstuhl Research Online Publication Server

Answer Refinement Modification: Refinement Type System for Algebraic Effects and Handlers

Author: Kawamata Fuga
Sekiyama Taro
Terauchi Tachio
Unno Hiroshi
Publication venue
Publication date: 28/07/2023
Field of study

Algebraic effects and handlers are a mechanism to structure programs with computational effects in a modular way. They are recently gaining popularity and being adopted in practical languages, such as OCaml. Meanwhile, there has been substantial progress in program verification via refinement type systems. However, thus far, there has not been a satisfactory refinement type system for algebraic effects and handlers. In this paper, we fill the void by proposing a novel refinement type system for algebraic effects and handlers. The expressivity and usefulness of algebraic effects and handlers come from their ability to manipulate delimited continuations, but delimited continuations also complicate programs' control flow and make their verification harder. To address the complexity, we introduce a novel concept that we call answer refinement modification (ARM for short), which allows the refinement type system to precisely track what effects occur and in what order when a program is executed, and reflect the information as modifications to the refinements in the types of delimited continuations. We formalize our type system that supports ARM (as well as answer type modification) and prove its soundness. Additionally, as a proof of concept, we have implemented a corresponding type checking and inference algorithm for a subset of OCaml 5, and evaluated it on a number of benchmark programs. The evaluation demonstrates that ARM is conceptually simple and practically useful. Finally, a natural alternative to directly reasoning about a program with delimited continuations is to apply a continuation passing style (CPS) transformation that transforms the program to a pure program. We investigate this alternative, and show that the approach is indeed possible by proposing a novel CPS transformation for algebraic effects and handlers that enjoys bidirectional (refinement-)type-preservation.Comment: 66 page

arXiv.org e-Print Archive

Work Analysis with Resource-Aware Session Types

Author: Carbonneaux Quentin
Griffith Dennis
Haemmerlé Rémy
Hofmann Martin
Pfenning Frank
Silva Miguel
Terauchi Tachio
Willsey Max
Çiçek Ezgi
Publication venue
Publication date: 26/04/2018
Field of study

While there exist several successful techniques for supporting programmers in deriving static resource bounds for sequential code, analyzing the resource usage of message-passing concurrent processes poses additional challenges. To meet these challenges, this article presents an analysis for statically deriving worst-case bounds on the total work performed by message-passing processes. To decompose interacting processes into components that can be analyzed in isolation, the analysis is based on novel resource-aware session types, which describe protocols and resource contracts for inter-process communication. A key innovation is that both messages and processes carry potential to share and amortize cost while communicating. To symbolically express resource usage in a setting without static data structures and intrinsic sizes, resource contracts describe bounds that are functions of interactions between processes. Resource-aware session types combine standard binary session types and type-based amortized resource analysis in a linear type system. This type system is formulated for a core session-type calculus of the language SILL and proved sound with respect to a multiset-based operational cost semantics that tracks the total number of messages that are exchanged in a system. The effectiveness of the analysis is demonstrated by analyzing standard examples from amortized analysis and the literature on session types and by a comparative performance analysis of different concurrent programs implementing the same interface.Comment: 25 pages, 2 pages of references, 11 pages of appendix, Accepted at LICS 201

arXiv.org e-Print Archive

Crossref

Dependent types from counterexamples

Author: Ball T.
Clarke E.M.
Johnsson T.
McMillan K.L.
Tachio Terauchi
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Quantitative Information Flow as Safety and Liveness Hyperproperties

Author: Abramsky
Backes
Backes
Barthe
Braun
Clark
Clark
Clarkson
Clarkson
Cohen
Darvas
Denning
Flanagan
Goguen
Herbert Wiklicky
Heusser
Hirotoshi Yasuoka
Köpf
Köpf
Köpf
Leino
Malacaria
Malacaria
Massey
McCamant
Mieke Massink
Naumann
Newsome
Shannon
Smith
Tachio Terauchi
Terauchi
Unno
Yasuoka
Yasuoka
Yasuoka
Černý
Publication venue: 'Open Publishing Association'
Publication date: 01/07/2012
Field of study

We employ Clarkson and Schneider's "hyperproperties" to classify various verification problems of quantitative information flow. The results of this paper unify and extend the previous results on the hardness of checking and inferring quantitative information flow. In particular, we identify a subclass of liveness hyperproperties, which we call "k-observable hyperproperties", that can be checked relative to a reachability oracle via self composition.Comment: In Proceedings QAPL 2012, arXiv:1207.055

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Dependent Types from Counterexamples

Author: Tachio Terauchi
Publication venue
Publication date: 01/01/2009
Field of study

Motivated by recent research in abstract model checking, we present a new approach to inferring dependent types. Unlike many of the existing approaches, our approach does not rely on programmers to supply the candidate (or the correct) types for the recursive functions and instead does counterexample-guided refinement to automatically generate the set of candidate dependent types. The main idea is to extend the classical fixed-point type inference routine to return a counterexample if the program is found untypable with the current set of candidate types. Then, an interpolating theorem prover is used to validate the counterexample as a real type error or generate additional candidate dependent types to refute the spurious counterexample. The process is repeated until either a real type error is found or sufficient candidates are generated to prove the program typable. Our system makes non-trivial use of “linear” intersection types in the refinement phase. The paper presents the type inference system and reports on the experience with a prototype implementation that infers dependent types for a subset of the Ocaml language. The implementation infers dependent types containing predicates from the quantifierfree theory of linear arithmetic and equality with uninterpreted function symbols

CiteSeerX

Crossref